Guess Where? Actor-Supervision for Spatiotemporal Action Localization

نویسندگان

  • Victor Escorcia
  • Cuong D. Dao
  • Mihir Jain
  • Bernard Ghanem
  • Cees Snoek
چکیده

This paper addresses the problem of spatiotemporal localization of actions in videos. Compared to leading approaches, which all learn to localize based on carefully annotated boxes on training video frames, we adhere to a weakly-supervised solution that only requires a video class label. We introduce an actor-supervised architecture that exploits the inherent compositionality of actions in terms of actor transformations, to localize actions. We make two contributions. First, we propose actor proposals derived from a detector for human and non-human actors intended for images, which is linked over time by Siamese similarity matching to account for actor deformations. Second, we propose an actor-based attention mechanism that enables the localization of the actions from action class labels and actor proposals and is end-to-end trainable. Experiments on three human and non-human action datasets show actor supervision is state-of-the-art for weakly-supervised action localization and is even competitive to some fullysupervised alternatives.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Practical Dramaturgy for Actors through Theatrical Production Procedures

Dramaturgy, as a creative and critical act dependent on the theoretical and practical knowledge of theater, consists of two parts with Greek roots: Drama (action) and Ourgia (work and operation). Ourgia literally means to practice, act, and in other words process a raw material. Eugenio Barba divides Dramaturgy in to three parts: Actor Dramaturgy, Director Dramaturgy and Audience Dramaturgy. Th...

متن کامل

Actor-independent action search using spatiotemporal vocabulary with appearance hashing

Human actions in movies and sitcoms usually capture semantic cues for story understanding, which offer a novel search pattern beyond the traditional video search scenario. However, there are great challenges to achieve action-level video search, such as global motions, concurrent actions, and actor appearance variances. In this paper, we introduce a generalized action retrieval framework, which...

متن کامل

Improved Estimation in Rayleigh type-II Censored Data under a Bounded Loss Utilizing a Point Guess Value

‎The problem of shrinkage testimation (test-estimation) for the Rayleigh scale‎ ‎parameter θ based on censored samples under the reflected‎ ‎gamma loss function is considered‎. We obtain the minimum risk‎ ‎estimator among a subclass and compute its risk‎. ‎A shrinkage‎ ‎testimator based on acceptance or rejection of a null hypothesis&lr...

متن کامل

Hierarchical curiosity loops and active sensing

A curious agent acts so as to optimize its learning about itself and its environment, without external supervision. We present a model of hierarchical curiosity loops for such an autonomous active learning agent, whereby each loop selects the optimal action that maximizes the agent's learning of sensory-motor correlations. The model is based on rewarding the learner's prediction errors in an ac...

متن کامل

Monitoring Erlang/OTP Applications using Multiparty Session Types

The actor model has emerged as a programming paradigm particularly suited to programming concurrent and distributed systems. Programming languages based on the actor model consist of lightweight processes which do not communicate using shared memory, relying instead on explicit message passing. Consequently, programming languages built on the actor model avoid many of the pitfalls associated wi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018